server room
If Anyone Builds it, Everyone Dies review – how AI could kill us all
W hat if I told you I could stop you worrying about climate change, and all you had to do was read one book? Great, you'd say, until I mentioned that the reason you'd stop worrying was because the book says our species only has a few years before it's wiped out by superintelligent AI anyway. We don't know what form this extinction will take exactly - perhaps an energy-hungry AI will let the millions of fusion power stations it has built run hot, boiling the oceans. Maybe it will want to reconfigure the atoms in our bodies into something more useful. There are many possibilities, almost all of them bad, say Eliezer Yudkowsky and Nate Soares in If Anyone Builds It, Everyone Dies, and who knows which will come true.
- North America > United States (0.17)
- Oceania > Australia (0.05)
- North America > Mexico (0.05)
- Europe > Ukraine > Kyiv Oblast > Chernobyl (0.05)
- Government (1.00)
- Leisure & Entertainment > Sports (0.71)
- Health & Medicine (0.67)
Data Center Cooling System Optimization Using Offline Reinforcement Learning
Zhan, Xianyuan, Zhu, Xiangyu, Cheng, Peng, Hu, Xiao, He, Ziteng, Geng, Hanfei, Leng, Jichao, Zheng, Huiwen, Liu, Chenhui, Hong, Tianshun, Liang, Yan, Liu, Yunxin, Zhao, Feng
The recent advances in information technology and artificial intelligence have fueled a rapid expansion of the data center (DC) industry worldwide, accompanied by an immense appetite for electricity to power the DCs. In a typical DC, around 30~40% of the energy is spent on the cooling system rather than on computer servers, posing a pressing need for developing new energy-saving optimization technologies for DC cooling systems. However, optimizing such real-world industrial systems faces numerous challenges, including but not limited to a lack of reliable simulation environments, limited historical data, and stringent safety and control robustness requirements. In this work, we present a novel physics-informed offline reinforcement learning (RL) framework for energy efficiency optimization of DC cooling systems. The proposed framework models the complex dynamical patterns and physical dependencies inside a server room using a purposely designed graph neural network architecture that is compliant with the fundamental time-reversal symmetry. Because of its well-behaved and generalizable state-action representations, the model enables sample-efficient and robust latent space offline policy learning using limited real-world operational data. Our framework has been successfully deployed and verified in a large-scale production DC for closed-loop control of its air-cooling units (ACUs). We conducted a total of 2000 hours of short and long-term experiments in the production DC environment. The results show that our method achieves 14~21% energy savings in the DC cooling system, without any violation of the safety or operational constraints. Our results have demonstrated the significant potential of offline RL in solving a broad range of data-limited, safety-critical real-world industrial control problems.
- North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
- Asia > China > Shanghai > Shanghai (0.04)
- Energy (1.00)
- Construction & Engineering > HVAC (1.00)
Robustness Assessment of Mathematical Reasoning in the Presence of Missing and Contradictory Conditions
Tian, Shi-Yu, Zhou, Zhi, Jia, Lin-Han, Guo, Lan-Zhe, Li, Yu-Feng
Large language models (LLMs) have demonstrated impressive performance on reasoning tasks, which can be further improved through few-shot prompting techniques. However, the current evaluation primarily focuses on carefully constructed benchmarks and neglects the consideration of real-world reasoning problems that present missing and contradictory conditions, known as ill-defined problems. Our observations suggest that existing few-shot prompting techniques are ineffective in such scenarios, often providing overconfident answers or hallucination. To further study this problem, we develop a benchmark called Problems with Missing and Contradictory conditions (PMC) and introduce two novel metrics to evaluate the performance of few-shot prompting methods in these scenarios. Our analysis using the PMC benchmark reveals a trade-off dilemma between the performance of mathematical reasoning for well-defined problems and the ability to recognize ill-defined problems. To address the challenges posed by PMC, we propose a novel few-shot prompting method called SMT-LIB Prompting (SLP), which utilizes the SMT-LIB language to model the problems instead of solving them directly. Subsequently, a double-check solving strategy checks the satisfiability and uniqueness of the solution and provides final feedback. Extensive experiments demonstrate the superiority of our SLP approach compared to existing few-shot prompting methods when dealing with problems with missing and contradictory conditions. We will open-source our benchmark and code to facilitate future research.
- Law (0.67)
- Leisure & Entertainment > Sports > Basketball (0.47)
- Education (0.46)
Distillation Contrastive Decoding: Improving LLMs Reasoning with Contrastive Decoding and Distillation
Phan, Phuc, Tran, Hieu, Phan, Long
We propose a straightforward approach called Distillation Contrastive Decoding (DCD) to enhance the reasoning capabilities of Large Language Models (LLMs) during inference. In contrast to previous approaches that relied on smaller amateur models or analysis of hidden state differences, DCD employs Contrastive Chain-of-thought Prompting and advanced distillation techniques, including Dropout and Quantization. This approach effectively addresses the limitations of Contrastive Decoding (CD), which typically requires both an expert and an amateur model, thus increasing computational resource demands. By integrating contrastive prompts with distillation, DCD obviates the need for an amateur model and reduces memory usage. Our evaluations demonstrate that DCD significantly enhances LLM performance across a range of reasoning benchmarks, surpassing both CD and existing methods in the GSM8K and StrategyQA datasets.
- North America > United States > Pennsylvania (0.04)
- Asia > Vietnam (0.04)
- North America > Canada > Ontario > Toronto (0.04)
Towards Understanding Chain-of-Thought Prompting: An Empirical Study of What Matters
Wang, Boshi, Min, Sewon, Deng, Xiang, Shen, Jiaming, Wu, You, Zettlemoyer, Luke, Sun, Huan
Chain-of-Thought (CoT) prompting can dramatically improve the multi-step reasoning abilities of large language models (LLMs). CoT explicitly encourages the LLM to generate intermediate rationales for solving a problem, by providing a series of reasoning steps in the demonstrations. Despite its success, there is still little understanding of what makes CoT prompting effective and which aspects of the demonstrated reasoning steps contribute to its performance. In this paper, we show that CoT reasoning is possible even with invalid demonstrations - prompting with invalid reasoning steps can achieve over 80-90% of the performance obtained using CoT under various metrics, while still generating coherent lines of reasoning during inference. Further experiments show that other aspects of the rationales, such as being relevant to the query and correctly ordering the reasoning steps, are much more important for effective CoT reasoning. Overall, these findings both deepen our understanding of CoT prompting, and open up new questions regarding LLMs' capability to learn to reason in context.
- Oceania > Australia (0.04)
- North America > United States > Ohio (0.04)
- Asia > Middle East > UAE > Abu Dhabi Emirate > Abu Dhabi (0.04)
- (4 more...)
- Media > Film (0.47)
- Leisure & Entertainment (0.47)
The elephant in the server room
Suppose you would like to know mortality rates for women during childbirth, by country, around the world. One option is the WomanStats Project, the website of an academic research effort investigating the links between the security and activities of nation-states, and the security of the women who live in them. The project, founded in 2001, meets a need by patching together data from around the world. Many countries are indifferent to collecting statistics about women's lives. But even where countries try harder to gather data, there are clear challenges to arriving at useful numbers -- whether it comes to women's physical security, property rights, and government participation, among many other issues.
- North America > United States > Massachusetts > Middlesex County > Cambridge (0.40)
- Oceania > Australia (0.05)
- North America > Canada (0.05)
- Europe > Western Europe (0.05)
- Law (1.00)
- Commercial Services & Supplies > Security & Alarm Services (0.55)
What used to take up a whole server room, is taking us to another realm of communication
When most people think of artificial intelligence, they think of Cylons, or Terminators, or HAL--sentient robots who turn on their masters in an effort to destroy the human race. But companies like SwiftKey, who are making helpful, smartphone-compatible artificial intelligence apps, are trying to change that. SwiftKey Neural Alpha is the world's first artificially intelligent smartphone keyboard. The keyboard uses machine learning to help predict what words the user is likely to enter by understanding the way the user puts sentences together. "Human language itself is a pattern," said Michael Smith, VP of Product at SwiftKey.
- Information Technology > Artificial Intelligence (1.00)
- Information Technology > Communications > Mobile (0.77)